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CryEngine 2: Shading Overview 

Gam^Devel'LHpers 

“'"*"0 . Support shader model 2.0, up to 4.0 

• Completely dynamic lighting and shadows 

• Up to 4 point light sources per-pass 

• Wide range of known and DIY shading models 

• Some other fancy features 

• Deferred mix with multi-pass rendering approach 

• Average of 2K drawcalls per frame (~2M tris) 
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Water and Underwater Rendering 

• Intro water rendering video 
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Water and Underwater Rendering 

• Rendering believable looking water 

Underwater light-scattering [l] 

Water surface animation & tessellation 

Reflections/Refraction 

Shore/Foam 

Caustics and God-rays 

Camera and objects interaction with water 

Particles 


• How to make all this efficiently in a very complex 
and open ended world in a game like Crysis ? 
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No more flat water ! 
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® • 3D waves 

• Used statistical Tessendorf animation model [2] 

• Computed on CPU for a 64x64 grid 

• Upload results into a FP32 texture 

• Vertex displacement on GPU 

• Lower HW specs used sin waves sum 

• Additionally 4 moving normals maps layers 
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Surface Tessellation 
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Surface Tessellation 
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Camera aligned grid 

Keep detail nearby and in front of camera 

Problems 
Camera roll 


alaa ciiiaa nearby VS far away detail 

this approach in thpcnd ^ 
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Surface from a top perspective 
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Physics Interaction 
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• CPU animation is shared with Physics/Game 

• For lowspec machines, did same “math” as in 
vertex shader on CPU 

• Physics samples best water plane fitting object 
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Reflection 

Per frame, we had an avg of 2K drawcalls (~ 2M tris) 

• This really needed to be cheap - and look good 

• Problem: Reflections added about 1500 drawcalls 

• Draw calls minimization 

• Final average of 300 drawcalls for reflections 

• Total internal reflection also simulated 

• Half screen resolution RT 

■f r +■* 
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Reflection Update Trickery 


uaif VTLyc vcisjijici j _ _ ■ __ . ■ 

“'"*"0 Update Dependencies 

• Time 

• Camera orientation/position difference from 
previous camera orientation/position 

• Surface visibility ratio using HW occlusion queries 

• Multi-GPU systems need extra care to avoid out of 
sync reflection texture 


CMP 
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Anisotropic Reflection 


Blur final reflection texture vertically 
Also helps minimizing reflection aliasing 


Ik.. 







Refraction 
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No need to render scene again [3] 

Use current back-buffer as input texture 
Mask out everything above water surface 
Water depth > World depth = leaking 

Don't allow refraction texture offset for this case 


• Chromatic dispersion approx, for interesting look 

Scaled offset for R, G and B differently 

C '.1 
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Refraction Masking 




Chromatic Dispersion 
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Chromatic dispersion 
Exagerated for picture 
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Procedural Caustics 
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Extra rendering pass, can handle 
opaque/transparent 

Based on real sun direction projection 

Procedural composed using 3 layers 

Chromatic dispersion approximation 

Darken slightly to simulate wet surface 
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Procedural Caustics 
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Shore and Foam 
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Soft water intersection 

Shore blended based on surface depth 
distance to world depth 

Waves foam blended based on current height 
distance to maximum height 

Foam is composed of 2 moving layers with 
offsets perturbation by water bump 

Acceptable quality 
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Shore and Foam 
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Underwater God-Rays m 

• Essentially the same procedural caustics shader 

• Based on real sun direction projection 

• Projected into multiple planes in front of camera 

• Rendered into a 4x smaller than screen RT 

• Finally add to frame-buffer 







Underwater God-Rays 


Light Scattering 


+ Caustics 
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_+ God-Rays 

(exaggerated for picture) 
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Camera/Particles Interaction 




How to handle case where camera intersects an 
animated water surface ? 

Water droplets effect when coming out of water 
When inside water used a subtle distortion 
Water particles similar to soft-particles 


WWW.GDCONF.COM 


Things for the Future 
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Rolling waves didn’t made into final game 

Special animated water decal geometry 


• Water splashes 

• Surface interaction with shoreline 

• Dynamic surface interaction 

• Maybe in nearby future project ? © © © 
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Frozen Surfaces 

Intro frozen surfaces video 
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Frozen Surfaces 
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Huge Headache 

Haven’t found previous research on the subject 

Unique Alien Frozen World: How to make it ? 

Should it look realistic ? 

Or an “artistic” flash frozen world ? 

Make everything Frozen ? 

Dynamically ? 

Custom frozen assets ? 

Reuse assets ? 

Took us 4 iterations until final result 
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Lessons learned 

Final iteration 

Main focus was to make it visually interesting 

• Used real world images as reference this time 

• Minimize artist amount of work as much as 
possible 

• Impossible to make every single kind of object look 
realistically frozen {and good) with a single unified 
approach © 

• 1 week before hitting feature lock/alpha (gulp) 


WWW.GDCONF.COM 







Putting all together 








Accumulated snow on top 

Blend in snow depending on WS/OS normal z 

Frozen water droplets accumulated on side 

2 layers using different uv and offset bump scales to give 
impression of volume 

3D Perlin noise used for blending variation 

3 octaves and offsets warping to avoid repetitive patterns 

Glittering 

Used a 2D texture with random noise vectors 
Pow( saturate( dot(noise, viewVector), big value) 

If result > threshold, multiply by big value for hdr to kick in 
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Procedural Frozen 
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Reused assets 

Dynamic freezing possibility and with nice transition 
Didn’t gave artists more control than required 

Artists just press button to enable frozen 

Relatively cheap, rendering cost wise 
Visually interesting results 


• Only looks believable under good lighting conditions 
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Post-Effects 



Intro post-effects video 
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Post-Effects Overview 


'"'•■‘“*0 Post Effects Mantra: 

• Final rendering “make-up” 

• Minimal aliasing (for very-hi specs) 

• Never sacrifice quality over speed 

Unless you’re doing really crazy expensive stuff ! 

• Make it as subtle as possible 

But not less - or else average gamer will not notice it 


CMK 
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Camera Motion Blur (CMB) 
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LDR 

No bright streaks 
Washed out details 


CMP 
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Screen-space velocities 
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Render a sphere around camera 

Use previous/current camera transformation to 
compute delta vector 

Lerp between previous/current transformation by a 
shutter speed ratio ( n / frame delta ), to get correct 
previous camera matrix 

From previous/current positions compute velocity 
vector 

Can already accumulate N samples along 
velocity direction 

But will get constant blurring everywhere 
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Velocity Mask 

Gam^DeveL'Dpers 

“'"*"0 • Used depth to generate velocity mask 

• We let camera rotation override this mask 


• Depth is used to mask out nearby geometry 

If current pixel depth < nearby threshold write 0 

Value used for blending out blur from first person 
arms/weapons 

• Velocity mask is used later as a scale for 
motion blurring velocity offsets 

Blurring amount scales at distance now 

C ',1 P 
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CMB Vertex Shader Sample 
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vPos.xyz += vWorldViewPos .xyz; 

float4 vNewPos = mul(mViewProj , vPos); 
float4 vPrevPos = mul( mViewProjPrev, vPos ); 

OUT. HPosition = vNewPos; 

OUT.vCurr = HPosToScreenTC( vNewPos ); 

OUT.vPrev = HPosToScreenTC( vPrevPos ); 


CMP 
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CMB Pixel Shader Sample 

half4 cMidCurr = tex2Dproj (screenMap, IN.vCurr); 

half fDepth = tex2Dproj (depthMap, IN . vCurr) .x*NearFarClipDist .y; 

float2 vStart = IN. vCurr .xy/IN. vCurr .w; 

float2 vPrev = (IN. vPrev. xy/IN. vVPrev.w) * fScale; 

float2 vCurr = vStart * fScale; 


float2 vStep = vPrev - vCurr; 
float4 accum = 0; 


[unroll] 

for (float s= -1.0; s<l.0 ; s+= fweightstep ) { 
float2 tcFinal = vCurr. xy - vStep.xy * s; 

// Apply depth scaling/masking 

half fDepthMask = tex2D(screenMap, tcFinal). w; 

tcFinal += vStep.xy * (s - s * fDepthMask); 





i.* 


accum += tex2D(screenMap, tcFinal ); 

} 

accum *= fWeight; 

// Remove remaining scene bleeding from 1st player hands 
OUT. Color = lerp(cMidCurr, accum, saturate(fDepth-1.0) ) ; 
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Improving quality 













Iterative sampling approach 
First pass uses 8 samples 
Ping-pong results 

Second pass uses blurred results, this results 
in 8 * 8 samples (virtually 64) 

3rd = 512 samples, 4th = 4096, etc 


• High quality at relatively low cost 
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Iterative quality improve 
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Optimization strategies 

If no movement skip camera motion blur 
entirely 

Compare previous camera transformation with current 

Estimate required sample count based on 
camera translation/orientation velocity 

If sample count below certain threshold, skip 
Adjust pass/sample count accordingly 

This gave a nice performance boost 

Average case at 1280x1024 runs at ~ 1 ms on a G80 
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Object Motion Blur (OMB) 
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LDR ^ 

Bright streaks gone~ 
Washed out details 


M 
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HDR 

Bright Streaks 
Sharper Details 
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DX9 HW Skinning limitation 

Gam^Developers 

“'"*"0 • 256 vertex constant registers limit 

• Our characters have an average of 70 bones 
per drawcall 

• Each bone uses 2 registers = 140 registers 

• For motion blur we need previous frame bones 
transformations 

• 2 X 140 = 280 registers, bummer.. 

• Decided for DXIO only solution 
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Step 1: Velocity pass 

Render screen space velocity, surface depth 
and instance ID into a FP16 RT 


• If no movement, skip rigid/skinned geometry 

Compare previous transformation matrix with current 
Compare previous bone transformations with current 

• If per-pixel velocity below certain threshold 
write 0 to RT 

Can use this data for optimizing further 
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Why dilation needed ? 










step 2: Velocity Mask 




Used a 8x smaller render target 

Apply Gaussian blur to velocity length 

Result is a reasonable fast estimation of screen 
space needed for dilation and motion blurring 

Mask is used to efficiently skip pixels during 
dilation passes 
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step 3: Velocity Dilation 


GameDeveLopers 




Edge dilation 

Done using separated vertical and horizontal 
offsets 

4 passes total (2 for horizontal, 2 for vertical) 

If center offset has velocity or velocity mask is 0 
skip processing entirely 

Dilate if world depth > surface depth 
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Dilation shader sample 


float4 vCenterVZID = tex2D(tex0, tc.xy); 
float fCenterDepth = GetResampledDepth(texl, tc.xy); 
float fOffsetScale = tex2D(tex2, tc.xy). x; 

if( fOffsetScale == 0 | | dot ( vCenterVZID. xy, vCenterVZID. xy) ){ 
OUT. Color = f loat4( vCenterVZID. xyzw) ; 
return OUT; 


[unroll] 

for( int n = 0; n < nOffsets; n++ ) { 

floatZ tcLookup = tc.xy + vOffsets[n] .xy *vScrSizeRecip; 
float4 vCurrVZiD = tex2Dlod(tex0, float4( tcLookup , 0, 0)); 

float fDepthCmp = saturate( fCenterDepth- vCurrVZID.z ); 
fDepthCmp *= dot(vCurrVZID.xy, vCurrVZID.xy) ; 
fDepthCmp *= Dilated. z == 0; 

if( fDepthCmp) 

Dilated = vCurrVZID; 


} 
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Velocity Dilation 


Original I Step 1 


Step 2 











Final Step: Motion Blurring 

. Accumulate N samples along velocity direction 

• Can use surface ID to avoid leaking 

Extra lookup.. 

• Clamp velocity to acceptable range 

Very important to avoid ugiy resuits, especialiy with 
jerky animations 

• Divide accumulated results by sample count 
and lerp between blurred/non blurred 

Using alpha channel blend mask 
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OMB Pixel Shader Sample 

float4 cScreen = tex2Dlod(tex0, float4(tc .xy, 0, 0)); 
float4 cVelocity = tex2Dlod(texl, float4(tc .xy, 0, 0)); 
OUT. Color = cScreen; 

if( dot (cVelocity .xy, cVelocity .xy) < f Threshold ) 
return OUT; 


float4 cAccum = 0; 

float fLen = length (cVelocity .xy) ; 

if( fLen ) cVelocity. xy /= fLen; 

cVelocity. xy *= min (fLen, vMaxRange) //Clamp velocity to MaxRange 


[unroll] 

for (float i = 0; i < nSamples; i++) { 

float 2 tcMB = cVelocity * ((i * fRecipSamples) -0.5) + tc; 
float4 cCurr = tex2Dlod(tex0, float4(tcMB, 0, 0)); 
cAccum += float4(cCurr .xyz, saturate(10000 * cCurr.w)); 

} 

if( cAccum. w ) { // Blend with scene 

cAccum *= fRecipSamples; 

OUT. Color = lerp(cScreen, cAccum, saturate( cAccum. w * 2) ); 
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Blend with scene 
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Iterative quality improve 
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Object Motion Blur 

Object motion blur with per-pixel dilation 
Not perfect, but good quality results for most cases 
Geometry independent 
Problematic with alpha blending 


• Future improvements / challenges 

Self-motion blurring 

Multiple overlapping geometry + motion blurring 
Could use adaptive sample count for blurring 
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Sun Shafts [sj 

aka Crepuscular Rays/God Rays/Sun beams/Ropes of Maui, 
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W 
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Screen Space Sun Shafts 

Generate depth mask 

Mask = 1 - normalized scene depth 

Perform local radial blur on depth mask 

Compute sun position in screen space 

Blur vector = sun position - current pixel position 

Iterative quality improvement 

Used 3 passes (virtually = 512 samples) 

RGB = sun-shafts, A = vol. fog shadow aprox 

Compose with scene 

Sun-shafts = additive blending 
Fog-shadows = soft-blend [5] 


WWW.GDCONF.COM 





Depth Mask Radial Blurring 


CMP 
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Sun Shafts: Results 
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Color Grading 
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Artist control for final image touches 

Depending on “time of day” in game 
Night mood != Day mood, etc 

Usual saturation, contrast, brightness controls 
and color levels like in Far Cry [6] 

Image sharpening through extrapolation [9] 

Selective color correction 

Limited to a single color correction 
Photo Filter 
Grain filter 
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Image Sharpening 
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Selective Color Correction 

Color range based on Euclidian distance 

ColorRange = saturate(l - length( src - col.xyz) ); 

Color correction done in CMYK color space [8] 

c = lerp( c, clamp(c + dst_c, -1, 1), ColorRange); 

Finally blend between original and correct color 

Orig =lerp( Orig, CMYKtoRGB( c), ColorRange); 



Color Range 



Photo Filter 

Blend entire image into a different color mood 
Artist defines “mood color” 

cMood = lerp(0, cMood, saturate( fLum * 2.0 ) ); 
cMood = lerp(cMood, 1, saturate( fLum - 0.5 ) * 2.0 ); 

Final color is a blend between mood color and 
backbuffer based on luminance and user ratio 

final= lerp(cScreen, cMood , saturate( fLum * fRatio)); 



Conclusion 
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Awesome time to be working in real time 
computer graphics 

Hollywood movie image quality still far away 
Some challenges before we can reach it 

Need much more GPU power (Crysis is GPU bound) 

Order independent alpha blending - for real world 
cases 

Good reflection/refraction 

We still need to get rid of aliasing - Everywhere 
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Special Thanks 

To entire Crytek team 


All work presented here wouldn’t have been 
possible without your support © 
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Questions ? 



Tiago@Crytek.de 
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Color Grading 


• Additional comparison images 
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